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35 Abstract 

36 Genetic recombination has frequently been observed in coronaviruses. Here, we sequenced 

37 multiple complete genomes of dromedary camel coronavirus HKU23 (DcCoV-HKU23) from 

38 Nigeria, Morocco and Ethiopia and identified several genomic positions indicative of cross 

39 species virus recombination events among other Betacoronaviruses of the subgenus 

40 Embecovirus (clade A (3-CoVs). Recombinant fragments of a rabbit coronavirus (RbCoV- 

41 HKU14) were identified at the hemagglutinin esterase gene position. Homolog fragments of a 

42 rodent CoV were also observed at the 8.9 kDa open reading frame 4a at the 3’ end of the 

43 spike gene. The patterns of recombination varied geographically across the African region, 

44 highlighting a mosaic structure of DcCoV-HKU23 genomes circulating in dromedaries. Our 

45 results highlighted active recombination of coronaviruses circulating in dromedaries and is 

46 also relevant to the emergence and evolution of other Betacoronaviruses including MERS- 

47 coronavirus (MERS-CoV). 

48 

49 Importance 

50 Genetic recombination is often demonstrated in coronaviruses and can result in host range 

51 expansion or alteration in tissue tropism. Here, we showed interspecies recombination events 

52 of an endemic dromedary camel coronavirus HKU23 with other clade A Betacoronaviruses. 

53 Our results supported the possibility that the zoonotic pathogen, MERS-CoV, which also co- 

54 circulates in the same camel species, may have undergone similar recombination events 

55 facilitating its emergence or may do so in its future evolution. 

56 

57 Introduction 

58 Emerging infectious disease outbreaks usually arise by inter-species jumps of viruses 

59 between animal species, sometimes including humans. Coronaviruses have repeatedly made 

60 species jumps between animal species (e.g. SADS coronavirus from bats to swine) (1) and 

61 from animals to humans (2). Two human coronaviruses (HCoVs) 229E and OC43 now 

62 endemic in the human population emerged from camels and bovines respectively, within the 

63 past few hundred years (2, 3). SARS coronavirus emerged from bats via intermediate 

64 mammalian hosts in live game animal markets in Guangdong to spread to over 25 countries 

65 across 5 continents sickening almost 8000 people and leading to almost 800 deaths(4-6). The 
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66 ability of coronaviruses to make inter-species jumps is facilitated by complex virus-host 

67 interactions. High frequency of virus genetic recombination is one strategy for the virus to 

68 adapt to new host. 

69 Virus genetic recombination is frequently observed in coronaviruses and other positive sense 

70 RNA viruses. Murine hepatitis virus (MHV), a clade A P-CoV, is a well-studied example of 

71 homologous recombination, with up to 25% of its progeny in infected cells being 

72 demonstrated to be recombinants (7). High frequency of recombination is believed to be 

73 contributed by the large genome size, the intrinsic template-switching property of the viral 

74 RNA-dependent RNA polymerase (RdRp) during replication and the abundance of 

75 subgenomic RNA strands for template switching (8, 9). The role of RdRp in RNA 

76 recombination has been shown in poliovirus where a single amino acid residue mutation in 

77 the RdRp of poliovirus can result in a decrease in RNA recombination frequency (10). The 

78 exoribonucleases (ExoN) activity in replicase nonstructural protein (nsp) 14 of CoVs that 

79 constitutes the proofreading activity of genome replication has been suggested to be a 

80 potential regulator of RNA recombination (11). The presence of group-specific genes in 

81 CoVs is assumed to be a result of heterologous recombination, which involves exchange of 

82 non-homologus viral or cellular RNAs. The hemagglutinin esterase (HE) gene that is only 

83 expressed in clade A P-CoVs is believed to be acquired from influenza C virus through such 

84 heterologous recombination (12). 

85 In 2012, a novel respiratory pathogen, Middle East respiratory syndrome coronavirus 

86 (MERS-CoV), was isolated from a patient with severe respiratory illness in Jeddah, Saudi 

87 Arabia (13). It was a zoonotic virus found in dromedary camels which occasionally transmits 

88 to human (14-16). MERS-CoV is enzootic in dromedary camels in the Middle East as well as 

89 Africa with the greatest virus diversity found in Africa (17). Circulation of multiple lineages 

90 of clade B MERS-CoV in dromedary camels eventually resulted in a recombinant lineage 5 

91 virus that caused major outbreaks during 2015 both within Saudi Arabia and in South Korea, 

92 following introduction of the virus by a returning traveler (18). Recent studies have shown 

93 that two other coronaviruses, an alphacoronavirus dromedary camel coronavirus 229E 

94 (DcCoV-229E), and a P-CoV dromedary camel coronavirus HKU23 (DcCoV-HKU23) co- 

95 circulate in dromedaries in Saudi Arabia (18, 19). The co-circulation of at least three 

96 coronaviruses within camels provides an opportunity for the emergence of novel emerging 

97 infections via recombination. It is therefore important to investigate for evidence of 

98 recombination between coronaviruses co-circulating in dromedary camels because this may 
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99 have contributed to the emergence MERS-CoV and to future emergence of viruses of 

100 zoonotic and epidemic potential. Here we report the genetic diversity of DcCoV-HKU23 in 

101 the African camel population and identify several recombination events that have taken place 

102 with clade A |3-CoVs, including bovine coronavirus (BCoV) and more distant species such as 

103 rabbit coronavirus (RbCoVHKU14) and rodent coronavirus (RodentCoV). We carried out 

104 our studies in West (Nigeria), East (Ethiopia) and North (Morocco) Africa because over 70% 

105 of the global population of dromedaries are found in Africa and this is likely where the 

106 greatest diversity of these dromedary coronaviruses is likely to be manifest and where 

107 MERS-CoV emerged. 

108 

109 Methods & materials 

110 Sample collection. 

111 Nasal swabs and sera were collected from dromedary camels sampled in Nigeria, Morocco 

112 and Ethiopia in previous studies of MERS-CoV in 2015 and 2016 (17, 20). Camel nasal 

113 swabs were collected from a camel abattoir in Kano, Nigeria (n=2529) in 2015 and 2016 (17, 

114 21), from dromedary herds and abattoirs in Morocco (n=1569) in 2015 and 2016 and Ethiopia 

115 (n=621) in 2015 (20) (Table 1). The camels from Morocco and Ethiopia were mostly raised 

116 for meat, milk production or transport. Camel sera were concurrently collected from the 

117 abattoir in Nigeria (n=150) and from abattoirs and farms in both Ethiopia (n=100) and 

118 Morocco (n=100). Sampled camels were aged from 1 month-old to 20 years-old (median age 

119 of 3 years). 

120 

121 DcCoV-HKU23 detection and genome sequencing. 

122 Total nucleic acid was extracted from swab samples using EasyMAG (bioMerieux, France) 

123 system. RNA was reverse transcribed into cDNA with random hexamers using PrimeScript™ 

124 RT reagent Kit (Perfect Real Time) (Takara, Japan), according to manufacturer’s protocol. 

125 cDNAs were screened for DcCoV-HKU23 using a broad-range pancoronavirus nested PCR 

126 assay designed to detect known and unknown Co Vs targeting the consensus region of the 

127 RNA-dependent RNA-polymerase (RdRP) gene (22). RT-PCR positive amplicons were 

128 purified using the ExoSap-IT® reagent (USB, USA) and Sanger sequenced to identify the 

129 CoV identity. Samples with DcCoV-HKU23-like virus sequences were identified and 
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130 subjected to viral load quantification using a reverse transcription quantitative PCR (RT- 

131 qPCR) assay. Oligonucleotide sequences were designed to target both the N gene of DcCoV- 

132 HKU23 and Bovine CoV (Forward primer: 5’-GTCAATACCCCGGCTGAC-3’, Probe: 5’- 

133 (FAM)TCGGGACCCAAGTAGCGATGAGGC(BHQ)-3’ and Reverse primer: 5’- 

134 AACCCTGAGGGAGTACCG-3’). RT-qPCR reaction was performed using the TaqMan® 

135 Fast Virus 1-Step Master Mix (Thermo Fisher Scientific, USA), with the cycling protocol: 5 

136 min at 50°C for reverse transcription, followed by 20 seconds at 95°C and 40 cycles of 3 

137 seconds at 95°C and 30 s at 60°C. Samples with low cycle threshold (ct) values were selected 

138 for full genome sequencing. Reverse transcription with F1KU23 specific primers targeting 

139 different regions of the genome were used to generate cDNA, which were subsequently 

140 amplified by PCR with primers designed to generate overlapping amplicons that can cover 

141 the whole genome. The primer sequences are available upon request. PCR amplicons from 

142 each sample were pooled for next generation sequencing and processed with Nextera XT 

143 library preparation kit following the protocol provided by the manufacturer. Sequencing was 

144 performed using the lllumina MiSeq instrument with read length of paired ends of approx. 

145 300bp. Raw sequence reads generated were mapped to a reference DcCoV-HKU23 genome 

146 (KF906250.1) using BWA (23). Sequence of the target virus was generated by taking the 

147 majority consensus of the mapped reads with sequencing coverage at each position of higher 

148 than 100 times. 

149 

150 Genomic and Phylogenetic analysis. 

151 Opening reading frames (ORFs) of the virus genome encoding for proteins were predicted 

152 using ORFfinder (NIH, USA). Full genome of DcCoV-FlKU23 with previous sequences from 

153 Saudi Arabia, Bovine CoV and human CoV OC43 were aligned using MAFFT. Gaps and 

154 poorly aligned regions in the alignment were manually edited. Pairwise genetic distances 

155 were calculated using MEGA 7 (24). Phylogenetic analysis of DcCoV-FlKU23 was 

156 performed by maximum likelihood method using IQ-Tree, version 1.6.8 (25). 

157 Recombination analysis was performed using Simplot version 3.5.1 (26). Bootscan analysis 

158 for a recombination event was performed on an alignment of the genome sequences as 

159 described above, with a 50% consensus sequence of 4 DcCoV-FlKU23 in Nigeria with the 

160 same genotype C2/C 0ut iiei/C3 (NV1010, NV1092, NV1097 and NV1385) as the query 
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161 sequence. A sliding window of 600 nucleotides and a step of 100 nucleotide was used as the 

162 scanning setting. 

163 

164 Microneutralization assays. 

165 Heat-inactivated (56°C for 30 minutes) camel sera were first diluted 1:10, then serially two- 

166 fold diluted and mixed with equal volumes of virus at a dose of 200 50% tissue culture 

167 infective doses (TCID 5 o) of DcCoV-HKU23 isolate 368F (27). After lh of incubation at 37°C, 

168 35 pF of the virus-serum mixture was added in quadruplicate to HRT-18G cell monolayers 

169 in 96-well microtiter plates. After 1 h of adsorption, the virus-serum mixture was removed 

170 and replaced with 150 pL of virus growth medium to each well. The plates were incubated 

171 for 5 days at 37 °C in 5% C02 in a humidified incubator. Cytopathic effect was observed at 

172 day 5 post-inoculation. The highest serum dilution protecting >50% of the replicate wells was 

173 denoted the neutralizing antibody titer. A virus back titration of the input virus was included 

174 in each batch of tests. 

175 

176 Result 

177 Screening of DcCoV-HKU23 in African camels by rt-qPCR and microneutralization 

178 assay. 

179 Nasal swab samples of dromedary camels in Nigeria (n=2529), Morocco (n=1569) and 

180 Ethiopia (n=621) were tested for coronaviruses using the pan-CoV RT-PCR and identified by 

181 sequencing PCR amplicons (Table 1). (22). The overall prevalence of HKU23 viruses at each 

182 location ranged from 0.4% of 1569 samples tested in Morocco to 2.2% of 2529 from Nigeria 

183 (Table 1). A DcCoV-HKU23-specific quantitative real-time RT-PCR assay was subsequently 

184 performed to identify samples with a high viral RNA copy number for whole genome 

185 sequencing. 

186 In Morocco, DcCoV-HKU23 RNA positivity in young camels aged <2 years (n=584) was 

187 1.0% and not significantly different from adults (n=577) with 0.17% positive (Fisher’s exact 

188 test, P=0.124). In Nigeria, DcCoV-HKU23 RNA positivity in young camels was 4.1% 

189 (n=194) compared to 2.0% in adults (n=2335) (Fisher’s exact test, P=0.0674). In Ethiopia, 

190 young camels (n=136) had 1 positive swab while adults (n=314) had 5 positive swabs, which 
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191 was also without significant difference (Fisher’s exact test, P=0.673). Swab specimens were 

192 collected during the months October - April with virus detection in most months (Table 2). 

193 To study the seroprevalence of Dc-CoV-HKU23 in African camels, dromedary sera were also 

194 collected from a subset of camels during the same sampling occasions and were tested by 

195 micro-neutralization assay. A high seroprevalence was detected in dromedary camels in all 

196 three countries, with a seroprevalence of 92% of 150 sera in Nigeria, 91% of 100 sera in 

197 Ethiopia and 79% of 100 sera in Morocco respectively (Table 3). A lower seropositive rate 

198 was observed in younger (<2 years) compared to older Moroccan camels from abattoirs (48% 

199 vs. 92%; Fisher’s exact test, P=0.0036) and farms (76% vs. 100%; Fisher’s exact test, 

200 P=0.0223). There was no marked difference in seroprevalence of young and old camels in the 

201 Nigerian abattoir or in abattoirs or farms in Ethiopia. 

202 Cross-neutralizing antibody response of DcCoV-HKU23 and BCoV were evaluated by 

203 testing camel sera with high, medium, low and no neutralizing DcCoV-HKU23 titers by 

204 neutralization tests with BCoV-Mebus strain. There was significant correlation between titers 

205 of DcCoV-FlKU23 and BCoV, suggesting likely serological cross-reactivity between the two 

206 viruses (Figure 1). 

207 Evolutionary divergence and genetic diversity of DcCoV-HKU23. 

208 Full genomes of DcCoV-FlKU23 were obtained from four swab-samples in Nigeria (NVT010, 

209 NV1092, NV1097 and NV1385) and one sample each from Morocco (CAC2586) and 

210 Ethiopia (CAC1019). The African virus genomes were found closely related with pairwise 

211 base substitutions per site below 0.0270 (Table 4). These full genomes were compared with 

212 those previously reported from Saudi Arabia (18). DcCoV-FlKU23 in the African region 

213 differed from those from Saudi Arabia by a range of 0.0223 - 0.0270 pairwise base 

214 substitutions per site, comparable to the divergence observed among the regions in Africa. 

215 Compared to other closely related species within clade A P-CoVs, they were distanced to 

216 BCoV by a range of 0.0249 - 0.0300 pairwise base substitutions per site and to HCoV-OC43 

217 by a range of 0.0445 - 0.0468 pairwise base substitutions per site. The closest species related 

218 to HCoV-OC43 remains BCoV, rather than DcCoV-HKU23. 

219 

220 The genetic diversity of DcCoV-HKU23 across Africa and the Middle East was studied 

221 based on the ORFlab gene by a distance plot using SSE version 1.3 (28) (Figure 2). As 
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222 recombination has previously been shown to increase progressively from the 5’ to the 3’ end 

223 of the genome (29), ORFlab was selected to study the genetic diversity with minimal 

224 confounding by recombination. Along the position of ORFlab gene, a mean pairwise 

225 distance of about 0.01 was observed within the 6 DcCoV-FlKU23 sequences from Africa and 

226 the 4 reference sequences available from Saudi Arabia. The observed diversity of DcCoV- 

227 HKU23 was comparable to BCoV, suggesting both viruses were introduced into their animal 

228 hosts at similar points in time. Another circulating CoV in the same camel populations, 

229 MERS-CoV, was included in the analysis and showed a diversity of about 0.004 by the same 

230 analysis (Figure 2), relatively lower compared to DcCoV-HKU23. 

231 

232 Phylogenetic analysis of DcCoV-HKU23 with BCoV and HCoV-OC43 sequences. 

233 To infer the phylogenetic relationship of the newly identified African DcCoV-HKU23 with 

234 previously reported DcCoV-FlKU23 viruses from Saudi Arabia and bovine coronaviruses 

235 which are closely related, phylogenetic trees based on the complete coding sequences of 

236 RdRp (2783 nt), Spike (4101 nt) and nucleocapsid (1347 nt) gene were constructed. In 

237 addition to the 6 full genomes, four more virus sequences with complete RdRp, S, and N 

238 genes of DcCoV-HKU23 from Ethiopia (CAC1320, CAC1452) and Morocco (CAC2505, 

239 CAC2753), were obtained and included in the analysis. Using the genotyping nomenclature 

240 previously described for HCoV-OC43 and BCoV (30, 31) as a reference-point, our sequences 

241 of DcCoV-HKU23 in this study were mapped into the 3 main sub-clusters of BCoV, namely 

242 Cl, C2 and C3, in which Cl contains BCoVs from the Americas, C2 contains BCoVs from 

243 Europe and C3 contains the prototype BCoV (Figure 3). In the phylogenetic analysis of the 

244 RdRp gene, all the African and Saudi DcCoV-HKU23 clustered within clade C2, which 

245 includes BCoVs from Europe. African DcCoV-HKU23 sequences did not form a 

246 monophyletic clade with Saudi Arabia strain, instead these sequences were scattered within 

247 clade C2, suggesting a multiple ancestral origin of DcCoV-HKU23 across different 

248 geographic areas. 

249 The analysis of the spike gene showed that the 8 African and Saudi DcCoV-HKU23 were 

250 clustered together and grouped into a clade distinct from BCoV, which we designated as 

251 clade COutlier and has a basal phylogenetic relationship to the BCoV clades. The phylogeny of 

252 the viruses was geographically structured so that most of the sequences were grouped within 

253 the COutlier clade into subclades of Saudi Arabia, Ethiopia and Western / Northern Africa 
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254 (Nigeria and Morocco) viruses. The phylogenetic tree of the spike gene resembles the region 

255 dependent diversity of MERS-CoV as observed in these camels, in which viruses from Africa 

256 and Middle East were grouped into two separate clades (17). However, two sequences from 

257 Morocco (CAC2505 and CAC2753) were distinct from other sequences and fell into the 

258 clade C2 of BCoVs, sharing a common ancestor with a cluster of BCoVs from France. 

259 The phylogeny of the N gene of DcCoV-HKU23 was more diverse with virus sequences 

260 being distributed in BCoV clade C3 which included 4 Nigerian sequences (NV1010, NV1092, 

261 NV1097 and NV1385), one Moroccan sequence (CAC2586) and two sequences from 

262 Ethiopia (CAC1320). These sequences clustered together monophyletically and were related 

263 to the human enteric coronavirus strain 4408. The other 3 sequences, Ethiopia CAC1019 and 

264 Morocco (CAC2753, CAC2505) were grouped within BCoV clade C2 together with the 

265 strains from Saudi Arabia. 

266 Combining the clade classification of these three gene regions, there were 3 circulating 

267 genotypes of DcCoV-HKU23, viz C2/C2/C2; C2/Co u tiiei-/C2 and C2/Comiier/C3 (Table 5) 

268 suggesting multiple genetic recombination occurred in the past. This contrasts with BCoV 

269 which does not appear to exhibit such genetic instability (31). However, there is a lack of 

270 BCoV sequence data from Africa. These recombinant genotypes C2/Co u tiie/C2 and 

271 C2/Coutiier/C3 were observed in DcCoV-HKU23 across the African region without a distinct 

272 geographic pattern. Genotype C2/Co u tiier/C2 was observed in one sample from Saudi Arabia 

273 and one from Ethiopia, while C2/Co u tWC3 were observed in 4 samples from Nigeria, one 

274 from Ethiopia and one from Morocco. The BCoV genotype C2/C2/C2 was observed in two 

275 Moroccan strains (CAC2753, CAC2505), possibly suggesting a direct spill-over of BCoV 

276 genotype C2 to the camel population. 

277 

278 Distinct genetic region upstream of NS5a among DcCoV-HKU23. 

279 The genomic organization of African DcCoV-HKU23 is almost identical to Saudi Arabia 

280 DcCoV-HKU23 and BCoV, except for a 400nt region between S gene and NS5a that was 

281 found to be highly divergent among DcCoV-HKU23 and other clade A P-CoVs (Figure 4). In 

282 BCoV, this region contains 2 ORFs (4a and 4b) that encode a size of 4.9 kDa and 4.8 kDa 

283 non-structural proteins respectively. The absence of ORF4a and 4b in HCoV-OC43 suggested 

284 they are not essential for viral replication (32). Pairwise comparison of this region among all 

285 DcCoV-HKU23 with BCoV-DB2, RbCoV HKU14 and HCoV-OC43 revealed nonsense 
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286 mutations in both ORF4a and 4b in DcCoV-HKU23 from Saudi Arabia and Africa, resulting 

287 in premature stop codons and a truncated protein. A 200nt deletion was observed in NV1385 

288 after the premature stop codon of ORF4a. Similar, though not identical patterns of deletion 

289 were also found in RbCoV-FtKU14 and FlCoV-OC43 that resulted in more truncated protein 

290 sequences. Although these protein sequences are varied, the nucleotide sequences of DcCoV- 

291 HKU23/362F, DcCoV-HKU23/CAC1019, RbCoV-HKU14 and HCoV-OC43 in fact share a 

292 high pairwise similarity with the BCoV sequences that contain both full length ORF4a and 4b. 

293 Interestingly, DcCoV/NV1097 and DcCoV/CAC2586 contains a distinct ORF4a that encodes 

294 a 8.9 kDa non-structural protein among other DcCoVs in this region. Flomology of this 

295 protein was BLAST searched and mapped to another 8.6 kDa non-structural protein encoded 

296 by a Rodent CoV RtMm-CoV-l/IM2014 (Accession No.: KY370052.1) with about 60% 

297 amino acid similarity (Figure 4b). The BCoV ORF4a and 4b have previously been suggested 

298 to be counterparts of the 11 kDa non-structural protein in mouse hepatitis virus (MHV) (33). 

299 The discovery of this rodent-like ORF4a encoded in DcCoV-HKU23 illustrated a possible 

300 homologous recombination with rodent coronaviruses that highlighted its distinct 

301 evolutionary history as compared to BCoV. 

302 Recombination analysis with other clade A p-CoVs. 

303 To study the possibility of cross species recombination, Bootscan analysis of full genomes of 

304 DcCoV-FlKU23 in Africa with other clade A p-CoVs was performed using Simplot, version 

305 3.5.1. A multiple sequence alignment of the 6 African stain DcCoV-HKU23 with RbCoV- 

306 HKU14, PHEV, BCoV-DB2, EqCoV-NC99, RodentCoV-IM2014, DcCoV-HKU23 from 

307 Saudi Arabia and HCoV-OC43 was made. Using the Nigeria strain DcCoV-HKU23 as the 

308 query, recombination signals were observed with BCoV-DB2 at the NS2a gene (position 

309 21901-22600), with RbCoV-HKU14 at the hemagglutinin esterase (HE) gene (position 

310 22601-23600) and with the RodentCoV-IM2014 at the region of ORF4a, 4b and NS5a gene 

311 (position 27901-28800) (Figure 5a). Phylogenetic analysis of the BCoV signal region showed 

312 the BCoV-DB2 clustered with the group of DcCoV-HKU23 from Africa (Figure 5b). The 

313 signal at NS2a gene extended the region showing the mixing of BCoV with DcCoV-HKU23, 

314 in addition to the RdRp and N gene. The tree of the RbCoV signal region showed the 

315 RbCoV-HKU14 changed its phylogenetic position and linked to the cluster of DcCoV- 

316 HKU23 from Nigeria and Ethiopia. This clustering suggests a recombination event between a 

317 common ancestor of DcCoV-HKU23 sequences from Nigeria or Ethiopia with the RbCoV- 

318 HKU14 or a RbCoV-like virus. The RodentCoV signal at the region of ORF4a and 4b 


D 

o 


Downloaded from http://jvi.asm.org/ on September 23, 2019 at ECCLES HEALTH SCIENCES 



cfWology JVI Journal of Virology Accepted MdnUSCiTpFPosted Oflllhe 


319 supported the homologous recombinant of West African DcCoV-HKU23 with a RodentCoV 

320 in the genetic organization analysis. A tree plotted from position 27901-28800 showed 

321 DcCoV-HKU23 were split into two separate evolutionary pathways (as illustrated as BCoV- 

322 like and RodentCoV-like in Figure 4a), in which West Africa DcCoV-HKU23 were clustered 

323 outgroup with RodentCoV-IM2014, while East Africa DcCoV-HKU23 were clustered with 

324 BCoV. 

325 

326 Discussion 

327 Our data provides an enhanced understanding of the diversity and circulation of an endemic 

328 DcCoV-HKU23 in African dromedaries in East (Ethiopia), West (Nigeria) and North 

329 (Morocco) Africa in comparison with viruses in Saudi Arabia and for the evolutionary 

330 relationships between HKU23 and BCoV, an important pathogen of cattle (34). In this study, 

331 F1KU23 viral RNA was detected in 2.2% of dromedary nasal swabs tested in Nigeria, 0.5% in 

332 Morocco and 1.4% in Ethiopia respectively. The rates were comparatively higher than a 

333 previous study with year-around sampling of camel nasal swabs done in Saudi Arabia where 

334 a virus detection rate of 0.2% was reported (18). We detected DcCoV-HKU23 in most 

335 sampling months from October to April, which were the only periods of the year we 

336 investigated suggesting there was no clear seasonality of virus activity. While co-circulation 

337 of three CoVs, MERS-CoV, DcCoV-229E and DcCoV-HKU23 were reported in Saudi 

338 Arabia camels, similar virus circulation was also observed in Nigeria, with a positive rate of 

339 2.2% for MERS-CoV (21) and 1.0% for DcCoV-229E (data not published). Serological 

340 prevalence of DcCoV-HKU23 antibodies in camels were 92% in Nigeria, 91% in Ethiopia 

341 and 79% in Morocco respectively, suggesting a widespread circulation of this or an 

342 antigenically related virus over a broad geographical area across Africa, in a manner 

343 comparable with MERS-CoV (20, 35). Younger camels had lower seropositive rates than 

344 adults in Morocco, but no age related difference was observed in Nigeria and Ethiopia. 

345 Differences in DcCoV-HKU23 seroprevalence may be attributed to the variations in the 

346 husbandry practices, co-habitant animal hosts or climatic factors between these countries. 

347 Similar seroprevalence were observed in camels in abattoirs and farms, suggesting virus 

348 circulation has already been established at the farm or herd levels and did not solely reflect 

349 amplification in the camel marketing chain. There is extensive cross neutralization of both 

350 DcCoV-HKU23 and BCoV, with a trend to higher titers to DcCoV-HKU23. The high amino 
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351 acid sequence identity of the Spike protein (92% - 97%) of DcCoV-HKU23 and BCoV very 

352 likely contributed the cross neutralization between the two viruses. 

353 

354 The full genome sequences of DcCoV-HKU23 across the Africa and Saudi Arabia allowed us 

355 to study the genetic diversity of this virus in camel populations. The distance plot using the 

356 ORFlab gene can modestly evaluate the diversity within DcCoV-HKU23 sequences due to 

357 random mutations, with lesser effect contributed by recombination. The observed diversity of 

358 DcCoV-HKU23 was comparable to BCoV, suggesting both viruses have established in their 

359 hosts for a similar period of time. It is of interest to observe DcCoV-HKU23 to have a much 

360 higher diversity compared with MERS-CoV. At present there are three CoVs (Camel CoV 

361 229E, DcCoV-HKU23 and MERS-CoV) co-circulating in dromedary camels, the narrower 

362 genetic diversity of MERS-CoV possibly indicates a more recent introduction into camels or 

363 a purifying selection event in its more recent evolutionary history. 

364 Phylogenetic analysis of the full genomes of DcCoV-HKU23 in Africa with other published 

365 BCoV sequences available at Genbank identified incongruent topologies in the phylogenetic 

366 trees of RdRp, S and N, indicating events of recombination. With reference to the BCoV 

367 genotyping method described previously, DcCoV-HKU23 obtained in this study were 

368 classified into 3 genotypes: C2/C2/C2, C2/Co u tiier/C2 and C2/Co u tiier/C3. The Coutiierdade has 

369 no BCoV sequences and is a uniquely DcCoV-HKU23 clade. Two DcCoVs-HKU23 from 

370 Morocco (CAC2505, CAC2753) showed a genotype C2/C2/C2, which is a non-recombinant 

371 variant of BCoV clade2, suggesting a BCoV evolutionary origin and the possibility of a 

372 BCoV spill-over into the Moroccan camel population. This is the first time that a non- 

373 recombinant BCoV variant has been detected in camels. Previously, BCoV has been detected 

374 in a wide range of ungulate hosts, including bovine, waterbuck, sambar deer, white-tail deer, 

375 alpaca, giraffe, stable antelope, buffalo and yak (36-39). The expansion of BCoV to camel 

376 hosts further illustrates its ability to cross species and infect other similar ungulate hosts. 

377 Recently, surveillance of BCoV has been expanded into regions of East Asia, including 

378 China (40), Korea (41)and Vietnam (42), and Caribbean in Cuba (43). The Spike gene of 

379 BCoV from these regions were all phylogenetically clustered into Cl. Figure 6 summarizes 

380 the currently known geographic distribution of different genotypes of BCoV/HKU23-like 

381 viruses in camels, bovines and other species. 
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383 The phylogeny of the S gene of DcCoV exhibited a region dependent diversity that is also 

384 noted in BCoV and MERS-CoV. The Cl and C2 genotypes of BCoV correspond to the 

385 American-Asia and European cluster respectively. In this study, the C2 genotype of the two 

386 Morocco strains that clustered to BCoVs detected in France suggesting a divergence from the 

387 common ancestor of C2 genotype BCoVs. Other DcCoVs identified in this study as well as 

388 those previously identified in Saudi Arabia are recombinants, because their S gene showed an 

389 outgroup topology to BCoVs and HCoV-OC43, indicating DcCoV-HKU23 acquired its S 

390 gene through recombination with an ancestor yet to be identified. However, we cannot infer 

391 whether the recombination occurred prior or after the introduction of the virus to camels with 

392 the present data. The phylogeny of the N gene of DcCoV-HKU23 also varied among the 8 

393 recombinant sequences with Coutiier genotype in the S gene. 7 DcCoV-HKU23 sequences 

394 (NV1010, NV1092, NV1097, NV1385, CAC1320, CAC1452 and CAC2586) were grouped 

395 to C3 with the N gene tree, indicating a close relatedness to prototype BCoV in the N gene. 

396 DcCoV-HKU23/CAC1019 from Ethiopia and DcCoV-HKU23 from Saudi Arabia were 

397 grouped to the C2 European clade. These C2 genotype sequences suggest a possible 

398 recombination event between the N gene of a non-recombinant BCoV strain with a 

399 recombinant DcCoV-HKU23 strain as the backbone. Overall, DcCoV-HKU23 exhibited a 

400 broader diversity that contrasted to the genetic stability as observed in BCoV. However, a 

401 limitation in the analysis is the lack of BCoV sequence data in Africa. With more BCoV 

402 genetic datasets from Africa, the recombination events of DcCoV-HKU23 and BCoV may be 

403 more clearly resolved. 

404 

405 Cross species recombination of DcCoV-HKU23 was also observed with other clade A P-CoV 

406 species that involved a rabbit-CoV HKU14-like virus. Rabbit-CoV14 is a virus initially 

407 discovered through surveillance in China (44), but similar viruses may be present in a much 

408 wider geographic region. Bootscan analysis showed DcCoV-HKU23 from Nigeria and 

409 Ethiopia showed a recombination signal with RbCoV-HKU 14-like virus at the position 

410 22601-23600, encoding the NS2a and HE gene. The HE gene in clade A P-CoVs has been 

411 suggested to be acquired by a heterologous recombination with influenza C virus. The 

412 recombination here was located in a similar region suggesting a possible recombination 

413 hotspot. The phylogeny of the signal region showed the RbCoV sequence was linked to the 

414 DcCoV-HKU23 sequences from Ethiopia and Nigeria at a basal position, suggesting the 

415 recombination event may have occurred with the ancestral sequence from these two regions. 
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416 The HE protein encoded in clade A (J-CoVs plays a role in the receptor binding through its 

417 receptor binding or receptor destroying activity to glycan components (45). An example of 

418 the functional property of HE has been demonstrated in the adaptation of HCoV-OC43 and 

419 HKU1 to infect humans, when the HE lectin domain was progressively lost through 

420 accumulated mutations (46). The diversity of HE gene between parental and recombinant 

421 DcCoVs deserves further research to characterize the glycoconjugates targeted by the 

422 receptor destroying activity and study how such virion-glycan interactions may contribute to 

423 its host tropism. 

424 

425 We further identified multiple mutations in the downstream ORF4a and 4b encoded between 

426 S and NS5a gene that provided insight on the evolutionary origin of clade A P-CoVs. The 

427 deletion patterns of ORF4a and 4b observed in DcCoV-HKU23, RbCoV-HKU14 and HCoV- 

428 OC43 revealed a stepwise deletion among these sequences. While these patterns may suggest 

429 a BCoV-origin of these sequences, it is also possible that an ancestral virus infected multiple 

430 hosts and bovines preferentially retained those ORFs. Nonsense mutations and deletions of 

431 these ORFs in DcCoV-HKU23, RbCoV-HKU14 and HCoV-OC43 supported the contention 

432 that this region may not contain essential genetic sequences and the loss of such genetic 

433 information will not impair virus fitness in dromedaries. In fact, ORF4a and 4b in BCoV has 

434 previously been suggested as vestiges of an 11-kDa protein encoded by mouse hepatitis virus 

435 (MHV) resulting from a nonsense mutation in the middle of the ORF4 (33). The region 

436 between the S and E gene may suggest a mouse or murine CoV origin of this region. As 

437 additional evidence, we also observed another 8.9 kDa protein encoded by the ORF4a in 

438 DcCoV-HKU23 identified in this study which mapped to a similar protein in rodent CoV 

439 with 60% amino acid similarity. Bootscan analysis showed DcCoV from Morocco (CAC2586) 

440 and Nigeria (NV1010, 1092, 1097 & 1385) were phylogenetically outgrouped with rodent 

441 CoV at the position from 27901-28800, suggesting a possible homologous recombination. 

442 These sequences altogether illustrated a multiple origin of clade A p-CoVs from rodent like 

443 species. Recent surveillance of rodent species have identified many more novel CoV species, 

444 including ChRCoV-HKU24, LAMV, LRLV and rodent CoV (47-49). These sequences are 

445 phylogenetically positioned at the deep branch rooting members of clade A P-CoVs. The 

446 discovery of sequence remnants of rodent CoV in DcCoV-HKU23 further support the 

447 involvement of rodent like species in the evolutionary history of clade A P-CoVs (47). 


D 

o 


Downloaded from http://jvi.asm.org/ on September 23, 2019 at ECCLES HEALTH SCIENCES 


cfWology JVI Journal of Virology Accepted MdnUSCiTpFPosted Oflllhe 


448 The occurrence of the recombination events identified in this study requires the co-infection 

449 of two or more different CoVs in the same host and same cell and this requires the parental 

450 viruses to be co-circulating in the same geographic region. The wide host range and 

451 geographic range of BCoV-like viruses could potentially facilitate recombination with other 

452 coronaviruses. Recombination events that led to the emergence of SARS-like CoV likely 

453 occurred in its natural reservoirs in bats, and the virus then spilled over to intermediate hosts 

454 such as civets, raccoon dogs and humans. Since rodents harbor the highest diversity of clade 

455 A P-CoVs (49), one may speculate that it may be possible for recombination events to occur 

456 in rodents with the recombinant virus subsequently spilling over to other mammalian hosts if 

457 it has competitive advantages over pre-existing strains. Thus, one may speculate that the 

458 recombination of DcCoV-HKU23 with RabbitCoV-HKU14 and RodentCoV-IM2014 could 

459 have occurred in a rodent species. 

460 

461 Limitations of this study include the lack of availability of rectal swabs to evaluate the tissue 

462 tropism associated with DcCoV-HKU23 infection. Although the virus seems not to cause 

463 significant disease as it was detected in apparently healthy camels in abattoirs, it is unclear 

464 that whether infection is limited to the upper respiratory mucosa similar to MERS-CoV or 

465 whether it spreads more systemically. Specimens from sites of the body other than the upper 

466 respiratory tract may provide information on the tropism of the virus. The lack of year round 

467 sampling precludes conclusions on the seasonality of virus activity. On the other hand, the 

468 strength of the study is sampling across East, West and North Africa which allows an 

469 understanding of the virus diversity across a large geographic region. The lack of BCoV 

470 sequences from Africa precludes a more definitive analysis of the origins of the different 

471 genotypes of HKU23. 

472 In conclusion, the study showed a mosaic structure of DcCoV-HKU23 that is likely to be 

473 contributed by several recombination events among clade A P-CoVs. Among the three 

474 identified DcCoVs that circulate in dromedary camels, MERS-CoV has so far demonstrated 

475 intraspecies recombination, while DcCoV-HKU23, in addition, further demonstrated inter- 

476 species recombination. Our study highlighted the importance of studying recombination of 

477 CoVs to understand its evolutionary history and cross species transmission of coronaviruses 

478 in dromedaries. 
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650 Table 1. Screening of DcCoV-HKU23 in dromedary camels from Nigeria, Ethiopia and 

651 Morocco. 


Country 

Sampling 

year 

Sample positive / tested * 

(% positive) 

Ct range 

Field collections 
reported in Ref. 

Nigeria 

2015, 2016 

Total: 55/2529 (2.2%) 

Young camels: 8/194 (4.1%) 
Adult camels: 47/2335 (2.0%) 

18.8-36.9 

So et al,2018 

Morocco 

2015, 2016 

Total: 7/1569* (0.45%) 

Young camels: 6/584 (1.0%) 
Adult camels: 1/577 (0.17%) 

26.5-33.4 

Miguel et al, 2017 

Ethiopia 

2015 

Total: 9/621 *(1.4%) 

Young camels: 1/136 (0.74%) 
Adult camels: 5/314 (1.6%) 

23.2-30.2 

Miguel et al, 2017 


652 * Note: Age information were not available for all sampled camels. 
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654 
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655 Table 2. The monthly RNA rate of Dc-CoV-HKU23 in camels from Africa and Saudi Arabia. 


Month 

RNA rate (%) 

2014-2015 

2015-2016 

Morocco 

Ethiopia 

Nigeria 

Morocco 

May - September 

ND 

ND 

ND 

ND 

October 

ND 

ND 

1/526 (0.2%) 

ND 

November 

ND 

ND 

15/739 (2.0%) 

ND 

December 

ND 

ND 

1/35 (2.9%) 

ND 

January 

ND 

2/120 (1.7%) 

12/531 (2.3%) 

0/349 (0%) 

February 

0/195 (0%) 

7/501 (1.4%) 

26/698 (3.7%) 

ND 

March 

0/186 (0%) 

ND 

ND 

5/385 (1.3%) 

April 

ND 

ND 

ND 

2/453 (0.4%) 


656 ND, No data; 
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Table 3. The seropositive rate of Dc-CoV-HKU23 in camel sera from Nigeria, Ethiopia and Morocco by micro-neutralization assay. 


Region 

Nigeria 

Ethiopia 


Morocco 

Location type 

Abattoir 

Abattoir 

Farm 

Abattoir 


Camel age group 

Young 
(age <=2) 

Adult 
(age >2) 

Young 

Adult 

Young 

Adult 

Young 

Adult 

Young 

Adult 

Seropositive rate 

12/16 

123/134 

22/24 

22/26 

23/25 

24/25 

12/25 

23/25 

19/25 

25/25 

(75%) 

(92%) 

(92%) 

(85%) 

(92%) 

(96%) 

(48%) 

(92%) 

(76%) 

(100%) 

Median titre 

1:40 

1:20 

1:40 

1:40 

1:40 

1:40 

<1:10 

1:20 

1:20 

1:160 


Downloaded from http://jvi.asm.org/ on September 23, 2019 at ECCLES HEALTH SCIENCES 












Journal of Virology JVI Journal of Virology Accepted ManUSCTipFPoSted Onlme 


Table 4. Estimates of evolutionary divergence between the complete genome sequences of DcCoV-HKU23 identified in Africa (Nigeria, Ethiopia and 
Morocco) and Saudi Arabia, Bovine CoV and Human CoV-OC43. Analyses were conducted using the Tamura-Nei model using MEGA7. 


Host 

Country 

Strain 

Genome 

%GC 




Pairwise evolutionary divergence in the number of base substitutions per site 
















size (bp) 

NV1010 

NV1092 

NV1097 

NV1385 

C AC 1019 

CAC2586 

368F 

362F 

265F 

Ryl23 

DB2 

OC43 

Camel 

Nigeria 

DcCoV-HKU23-NV1010 

30780 

36.90% 















DcCoV-HKU23-NV1092 

30799 

36.90% 

0.0001 














DcCoV-HKU23-NV1097 

31075 

37.00% 

0.0167 

0.0168 













DcCoV-HKU23-NV1385 

30798 

36.90% 

0.0004 

0.0004 

0.0168 











Ethiopia 

DcCoV-HKU23- 

CAC1019 

31021 

36.90% 

0.0206 

0.0207 

0.0246 

0.0206 










Morocco 

DcCoV-HKU23- 

CAC2586 

31062 

37.00% 

0.0191 

0.0191 

0.0129 

0.0191 

0.0270 









Saudi 

DcCoV-HKU23-368F 

31052 

37.00% 

0.0263 

0.0265 

0.0231 

0.0263 

0.0223 

0.0225 








Arabia 

KF906251.1 

DcCoV HKU23-362F 
KF906250.1 

31052 

37.00% 

0.0263 

0.0265 

0.0231 

0.0263 

0.0223 

0.0225 

0.0000 








DcCoV-HKU23-265F 

KF906249.1 

31052 

37.00% 

0.0264 

0.0266 

0.0235 

0.0264 

0.0227 

0.0229 

0.0017 

0.0017 







DcCoV-HKU23-Ryl23 

KT368891.1 

31041 

37.00% 

0.0269 

0.0270 

0.0237 

0.0268 

0.0229 

0.0231 

0.0017 

0.0017 

0.0026 




Bovine 


BC0V-DB2 

DQ811784.2 

31007 

37.10% 

0.0299 

0.0300 

0.0288 

0.0299 

0.0249 

0.0279 

0.0197 

0.0197 

0.0200 

0.0204 



Human 


HCOV-OC43 

AY 391777.1 

30738 

36.80% 

0.0466 

0.0467 

0.0468 

0.0468 

0.0445 

0.0463 

0.0420 

0.0420 

0.0421 

0.0425 

0.0333 
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Table 5. Summary of the genotypes of DcCoV-HKU23 identified in this study. 




RdRp 

S 

N 

Genotype 

DcCoV-HKU23 Nigeria 

NV1010 

C2 

C outlier 

C3 

C2/C outlier/C3 


NV1092 

C2 

C outlier 

C3 

C2/C outlier/C3 


NV1097 

C2 

C outlier 

C3 

C2/C outlier/C3 


NV1385 

C2 

C outlier 

C3 

C2/C outlier/C3 

Ethiopia 

CAC1019 

C2 

C outlier 

C2 

C2/C outlier/C2 


CAC1320 

C2 

C outlier 

C3 

C2/C outlier/C3 


CAC1452 

C2 

C outlier 

C3 

C2/C outlier/C3 

Morocco 

CAC2505 

C2 

C2 

C2 

C2/C2/C2 


CAC2586 

C2 

C outlier 

C3 

C2/C outlier/C3 


CAC2753 

C2 

C2 

C2 

C2/C2/C2 

Saudi 

362F 

C2 

C outlier 

C2 

C2/C outlier/C2 

Arabia 






BCoV Europe 

BCoV/FRA 

C2 

C2 

C2 

C2/C2/C2 

Americas 

BCoV ENT 

Cl 

Cl 

Cl 

Cl/Cl/Cl 
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Figure 1. Scatter plot showing camel sera (n=13) with different neutralizing titres against DcCoV HKU23 
were tested for cross neutralization against BCoV. 
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Figure 2. Comparison of the genetic diversity of the ORFlab gene of DcCoV-FIKU23, BCoV and MERS-CoV. 
Sequence distance plot were generated using SSE version 1.3 using a sliding window of 250 and a step 
size of 25 nucleotides. Sequences were obtained from Genbank database and closely related sequences 
with a pairwise distance < 0.001 were excluded in the analysis. A total of 10 DcCoV-FIKU23 sequences, 

28 BCoV sequences and 88 MERS-CoV sequences were included in the analysis. 
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Figure 3. Phylogenetic analysis of a) RdRp, b) spike and c) nucleocapsid gene of DcCoV-PIKU23 from 
Nigeria (colored blue), Morocco (colored red) and Ethiopia (colored green). Reference DcCoV-HKU23 
sequences from Saudi Arabia were colored brown. Alignment of each gene is manually trimmed to 
obtain an alignment of 2769nt for RdRp, 4137nt for spike and 1347nt for nucleocapsid gene respectively. 
Tree was constructed by maximum likelihood method using IQ-Tree with the best-fit model 
automatically selected by ModelFinder. Nodes indicated bootstrap values calculated using ultrafast 
bootstrap with 1000 replicates. Trees were mid-point rooted. 


A) RdRp 
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Figure 4. A) Genomic organization of DcCoV-HKU23 and other clade A p-CoVs of the region between spike gene and NS5a. Distinct ORFs patterns were 
found in DcCoV-HKU23 in this region. Stop codons in the ORFs were labelled by black triangle. Horizontal dotted lines indicated region of deletion. B) The 
amino acid sequence alignment of the 8.9kDa ORF4a of rodent coronaviruses (RtAs-CoV/IM2014, accession no. KY370044; RtMm-CoV-l/IM2014, accession 
no. KY370052; RtMruf-CoV-2/JL2014, accession no. KY370046) and DcCoV-HKU23. 
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Figure 5. A) Recombination analysis of the genomes of DcCoV-HKU23 ; BCoV-DB2, PHEV, HCoV-OC43, RbCoV-HKU14, EquineCoV-NC99 and RodentCoV- 
RtMm-CoV-l/IM2014. Bootscan analysis was performed by Simplot, version 3.5.1, using a 50% consensus sequence of the DcCoV-HKU23 in Nigeria with the 
genotype C2/C out | ie r/C3 as the query. B) Phylogenetic trees from representative regions were constructed by maximum likelihood method using IQ-Tree, 
version 1.6.8. Trees were midpoint rooted. Accession number of the CoVs used in this analysis: DcCoV-HKU23/362F (KF906250.1), BCoV-DB2 (DQ811784.2), 
PHEV (KY994645), HCoV-OC43 (AY391777.1), RbCoV-HKU14 (JN874559), EquineCoV-NC99 (EF446615) and RodentCoV-IM2014 (KY370052). 
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Figure 6. The geographic distribution of different genotypes of BCoV/HKU23-like viruses in camels, 
bovines and other species. The map was drawn using R software. 
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